Native Language Identification Across Text Types: How Special Are Scientists?
نویسندگان
چکیده
منابع مشابه
Native Language Identification on Text and Speech
This paper presents an ensemble system combining the output of multiple SVM classifiers to native language identification (NLI). The system was submitted to the NLI Shared Task 2017 fusion track which featured students essays and spoken responses in form of audio transcriptions and iVectors by non-native English speakers of eleven native languages. Our system competed in the challenge under the...
متن کاملGeneralization in Native Language Identification: Learners versus Scientists
English. Native Language Identification (NLI) is the task of recognizing an author’s native language from text in another language. In this paper, we consider three English learner corpora and one new, presumably more difficult, scientific corpus. We find that the scientific corpus is only about as hard to model as a less-controlled learner corpus, but cannot profit as much from corpus combinat...
متن کامل(Non)native Language Teachers’ Cognitions: Are They Dichotomous?
In view of native/nonnative language teacher dichotomy, different characteristics have been assigned to these 2 groups. The dichotomy has been the source of different actions and measures to clarify the positive and negative points of being (non)native teachers. In recent years, many researchers have revisited this dichotomy. The challenge to the dichotomy can be a source of motivation to explo...
متن کاملParser evaluation across text types
When a statistical parser is trained on one treebank, one usually tests it on another portion of the same treebank, partly due to the fact that a comparable annotation format is needed for testing. But the user of a parser may not be interested in parsing sentences from the same newspaper all over, or even wants syntactic annotations for a slightly different text type. Gildea (2001) for instanc...
متن کاملFrom Language to Family and Back: Native Language and Language Family Identification from English Text
Revealing an anonymous author’s traits from text is a well-researched area. In this paper we aim to identify the native language and language family of a non-native English author, given his/her English writings. We extract features from the text based on prior work, and extend or modify it to construct different feature sets, and use support vector machines for classification. We show that nat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Italian Journal of Computational Linguistics
سال: 2016
ISSN: 2499-4553
DOI: 10.4000/ijcol.348